38 research outputs found

    Evolutionary distances in the twilight zone -- a rational kernel approach

    Get PDF
    Phylogenetic tree reconstruction is traditionally based on multiple sequence alignments (MSAs) and heavily depends on the validity of this information bottleneck. With increasing sequence divergence, the quality of MSAs decays quickly. Alignment-free methods, on the other hand, are based on abstract string comparisons and avoid potential alignment problems. However, in general they are not biologically motivated and ignore our knowledge about the evolution of sequences. Thus, it is still a major open question how to define an evolutionary distance metric between divergent sequences that makes use of indel information and known substitution models without the need for a multiple alignment. Here we propose a new evolutionary distance metric to close this gap. It uses finite-state transducers to create a biologically motivated similarity score which models substitutions and indels, and does not depend on a multiple sequence alignment. The sequence similarity score is defined in analogy to pairwise alignments and additionally has the positive semi-definite property. We describe its derivation and show in simulation studies and real-world examples that it is more accurate in reconstructing phylogenies than competing methods. The result is a new and accurate way of determining evolutionary distances in and beyond the twilight zone of sequence alignments that is suitable for large datasets.Comment: to appear in PLoS ON

    A molecular phylogeny of Hypnales (Bryophyta) inferred from ITS2 sequence-structure data

    Get PDF
    Background: Hypnales comprise over 50% of all pleurocarpous mosses. They provide a young radiation complicating phylogenetic analyses. To resolve the hypnalean phylogeny, it is necessary to use a phylogenetic marker providing highly variable features to resolve species on the one hand and conserved features enabling a backbone analysis on the other. Therefore we used highly variable internal transcribed spacer 2 (ITS2) sequences and conserved secondary structures, as deposited with the ITS2 Database, simultaneously. Findings: We built an accurate and in parts robustly resolved large scale phylogeny for 1,634 currently available hypnalean ITS2 sequence-structure pairs. Conclusions: Profile Neighbor-Joining revealed a possible hypnalean backbone, indicating that most of the hypnalean taxa classified as different moss families are polyphyletic assemblages awaiting taxonomic changes

    Internal Transcribed Spacer 2 (nu ITS2 rRNA) Sequence-Structure Phylogenetics: Towards an Automated Reconstruction of the Green Algal Tree of Life

    Get PDF
    L). Some have advocated the use of the nuclear-encoded, internal transcribed spacer two (ITS2) as an alternative to the traditional chloroplast markers. However, the ITS2 is broadly perceived to be insufficiently conserved or to be confounded by introgression or biparental inheritance patterns, precluding its broad use in phylogenetic reconstruction or as a DNA barcode. A growing body of evidence has shown that simultaneous analysis of nucleotide data with secondary structure information can overcome at least some of the limitations of ITS2. The goal of this investigation was to assess the feasibility of an automated, sequence-structure approach for analysis of IT2 data from a large sampling of phylum Chlorophyta.Sequences and secondary structures from 591 chlorophycean, 741 trebouxiophycean and 938 ulvophycean algae, all obtained from the ITS2 Database, were aligned using a sequence structure-specific scoring matrix. Phylogenetic relationships were reconstructed by Profile Neighbor-Joining coupled with a sequence structure-specific, general time reversible substitution model. Results from analyses of the ITS2 data were robust at multiple nodes and showed considerable congruence with results from published phylogenetic analyses.Our observations on the power of automated, sequence-structure analyses of ITS2 to reconstruct phylum-level phylogenies of the green algae validate this approach to assessing diversity for large sets of chlorophytan taxa. Moreover, our results indicate that objections to the use of ITS2 for DNA barcoding should be weighed against the utility of an automated, data analysis approach with demonstrated power to reconstruct evolutionary patterns for highly divergent lineages

    Computermethoden zur Bestimmung von Protein-Ligand Verweilzeiten in bakteriellen Enoyl-ACP Reduktasen und Vorhersage der Permeabilitätswahrscheinlichkeit kleiner Moleküle gegenüber der MycobacteriumMycobacterium tuberculosistuberculosis Zellwand

    No full text
    \textbf{Molecular Determinants of Drug-Target Residence Times of Bacterial Enoyl-ACP Reductases.} Whereas optimization processes of early drug discovery campaigns are often affinity-driven, the drug-target residence time tRt_R should also be considered due to an often strong correlation with \textit{in vivo} efficacy of compounds. However, rational optimization of tRt_R is not straightforward and generally hampered by the lack of structural information about the transition states of ligand association and dissociation. The enoyl-ACP reductase FabI of the fatty acid synthesis (FAS) type II is an important drug-target in antibiotic research. InhA is the FabI enzyme of \textit{Mycobacterium tuberculosis}, which is known to be inhibited by various compound classes. Slow-onset inhibition of InhA is assumed to be associated with the ordering of the most flexible protein region, the substrate binding loop (SBL). Diphenylethers are one class of InhA inhibitors that can promote such SBL ordering, resulting in long drug-target residence times. Although these inhibitors are energetically and kinetically well characterized, it is still unclear how the structural features of a ligand affect tRt_R. Using classical molecular dynamics (MD) simulations, recurring conformational families of InhA protein-ligand complexes were detected and structural determinants of drug-target residence time of diphenyl\-ethers with different kinetic profiles were described. This information was used to deduce guidelines for efficacy improvement of InhA inhibitors, including 5'-substitution on the diphenylether B-ring. The validity of this suggestion was then analyzed by means of MD simulations. Moreover, Steered MD (SMD) simulations were employed to analyze ligand dissociation of diphenylethers from the FabI enzyme of \textit{Staphylococcus aureus}. This approach resulted in a very accurate and quantitative linear regression model of the experimental ln(tR)ln(t_R) of these inhibitors as a function of the calculated maximum free energy change of induced ligand extraction. This model can be used to predict the residence times of new potential inhibitors from crystal structures or valid docking poses. Since correct structural characterization of the intermediate enzyme-inhibitor state (EI) and the final state (EI*) of two-step slow-onset inhibition is crucial for rational residence time optimization, the current view of the EI and EI* states of InhA was revisited by means of crystal structure analysis, MD and SMD simulations. Overall, the analyses affirmed that the EI* state is a conformation resembling the 2X23 crystal structure (with slow-onset inhibitor \textbf{PT70}), whereas a twist of residues Ile202 and Val203 with a further opened helix α6\alpha 6 corresponds to the EI state. Furthermore, MD simulations emphasized the influence of close contacts to symmetry mates in the SBL region on SBL stability, underlined by the observation that an MD simulation of \textbf{PT155} chain A with chain B' of a symmetry mate in close proximity of the SBL region showed significantly more stable loops, than a simulation of the tetrameric assembly. Closing Part I, SMD simulations were employed which allow the delimitation of slow-onset InhA inhibitors from rapid reversible ligands. \textbf{Prediction of \textit{Mycobacterium tuberculosis} Cell Wall Permeability.} The cell wall of \textit{M. tuberculosis} hampers antimycobacterial drug design due to its unique composition, providing intrinsic antibiotic resistance against lipophilic and hydrophilic compounds. To assess the druggability space of this pathogen, a large-scale data mining endeavor was conducted, based on multivariate statistical analysis of differences in the physico-chemical composition of a normally distributed drug-like chemical space and a database of antimycobacterial--and thus very likely permeable--compounds. The approach resulted in the logistic regression model MycPermCheck, which is able to predict the permeability probability of small organic molecules based on their physico-chemical properties. Evaluation of MycPermCheck suggests a high predictive power. The model was implemented as a freely accessible online service and as a local stand-alone command-line version. Methodologies and findings from both parts of this thesis were combined to conduct a virtual screening for antimycobacterial substances. MycPermCheck was employed to screen the chemical permeability space of \textit{M. tuberculosis} from the entire ZINC12 drug-like database. After subsequent filtering steps regarding ADMET properties, InhA was chosen as an exemplary target. Docking to InhA led to a principal hit compound, which was further optimized. The quality of the interaction of selected derivatives with InhA was subsequently evaluated using MD and SMD simulations in terms of protein and ligand stability, as well as maximum free energy change of induced ligand egress. The results of the presented computational experiments suggest that compounds with an indole-3-acethydrazide scaffold might constitute a novel class of InhA inhibitors, worthwhile of further investigation.\textbf{Molekulare Determinanten von Wirkstoff-Angriffsziel Verweilzeiten bakterieller Enoyl-ACP Reduktasen.} In frühen Phasen der Wirkstoffentwicklung sind Optimierungsprozesse häufig affini\-täts\-geleitet. Darüber hinaus sollte zusätzlich die Wirkstoff-Angriffsziel Verweilzeit tRt_R berücksichtigt werden, da diese oft eine starke Korrelation zur \textit{in vivo} Wirksamkeit der Substanzen aufweist. Rationale Optimierung von tRt_R ist jedoch auf Grund eines Mangels an struktureller Information über den Übergangszustand der Ligandbindung und Dissoziierung nicht einfach umsetzbar. Die Enoyl-ACP Reduktase FabI der Fettsäurebio\-synthese (FAS) Typ II ist ein wichtiger Angriffspunkt in der Antibiotikaforschung. InhA ist das FabI Enzym des Organismus \textit{Mycobacterium tuberculosis} und kann durch Substanzen diverser Klassen gehemmt werden. Es wird vermutet, dass Hemmung von InhA durch langsam-bindende (``slow-onset'') Inhibitoren mit der Ordnung der flexibelsten Region des Enzyms assoziiert ist, dem Substratbindungsloop (SBL). Diphenylether sind eine InhA Inhibitorenklasse, die eine solche SBL Ordnung fördern und dadurch lange Verweilzeiten im Angriffsziel aufweisen. Obwohl diese Inhibitoren energetisch und kinetisch gut charakterisiert sind, ist noch immer unklar, wie die strukturellen Eigenschaften eines Liganden tRt_R beeinflussen. Durch die Verwendung klassischer Molekulardynamik (MD) Simulationen wurden wiederkehrende Konformationsfamilien von InhA Protein-Ligand Komplexen entdeckt und strukturelle Determinanten der Wirkstoff-Angriffsziel Verweilzeit von Diphenylethern mit verschiedenen kinetischen Profilen beschrieben. Anhand dieser Ergebnisse wurden Richtlinien zur Wirksamkeitsoptimierung von InhA Inhibitoren abgeleitet, einschließlich einer 5'-Substitution am Diphenylether B-Ring. Die Validität dieses Vorschlags wurde mittels MD Simulationen nachfolgend analysiert. Darüber hinaus wurden ``Steered MD'' (SMD) Simulationen als MD Technik für umfangreicheres Sampling verwendet um die Liganddissoziation von Diphenylethern aus dem FabI Enzym von \textit{Staphylococcus aureus} zu untersuchen. Dieser Ansatz resultierte in einem sehr akkuraten, quantitativen linearen Regressionsmodell der experimentellen Verweilzeit ln(tR)ln(t_R) dieser Inhibitoren als Funktion der berechneten maximalen freien Energieänderung induzierter Ligandextraktion. Dieses Modell kann genutzt werden um die Verweilzeiten neuer potentieller Inhibitoren aus Kristallstrukturen oder validen Dockingposen vorherzusagen. Die korrekte strukturelle Charakterisierung des intermediären und des finalen Zustandes (EI und EI*-Zustand) eines Enzym-Inhibitor Komplexes bei einem zweistufigen Inhibitionsmechanismus durch langsam-bindende Hemmstoffe ist essentiell für rationale Verweilzeitoptimierung. Daher wurde die gegenwärtige Ansicht des EI und EI*-Zustandes von InhA mittels Kristallstrukturanalyse, MD und SMD Simulationen erneut aufgegriffen. Insgesamt bestätigten die Analysen, dass der EI*-Zustand einer Konformation ähnlich der 2X23 Kristallstruktur (mit langsam-bindenden Inhibitor \textbf{PT70}) gleicht, während eine Drehung der Reste Ile202 und Val203 mit einer weiter geöffneten Helix α6\alpha 6 dem EI-Zustand entspricht. Des Weiteren zeigten MD Simulationen den Einfluss naher Kristallkontakte zu Symmetrie-Nachbarn in der SBL Region auf die SBL Stabilität. Dies wird durch die Beobachtung hervorgehoben, dass die Ketten A und B' eines InhA-\textbf{PT155}-Komplexes und des angrenzenden Symmetrie-Nachbars, welche in engem Kontakt in der SBL Region stehen, signifikant stabilere SBLs aufweisen, als die Ketten A und B in einer Simulation des Tetramers. Zum Abschluss von Teil I wurden SMD Simulationen angewandt, auf deren Basis es möglich war, langsam-bindende InhA Inhibitoren von schnell-reversiblen (``rapid reversible'') Liganden zu unterscheiden. \textbf{Vorhersage von \textit{Mycobacterium tuberculosis} Zellwand Permeabilität.} Die Zellwand von \textit{M.~tuberculosis} erschwert die antimycobakterielle Wirkstofffindung auf Grund ihrer einzigartigen Zusammensetzung und bietet eine intrinsische Antibiotikaresistenz gegenüber lipophilen und hydrophilen Substanzen. Um den chemischen Raum wirkstoffähnlicher Moleküle gegen diesen Erreger (``Druggability Space'') einzugrenzen, wurde eine groß angelegte Dataminingstudie durchgeführt, welche auf multivariater statistischer Analyse der Unterschiede der physikochemischen Zusammensetzung eines normalverteilten wirkstoffähnlichen chemischen Raumes und einer Datenbank von antimycobakteriellen -- und somit höchstwahrscheinlich permeablen -- Substanzen beruht. Dieser Ansatz resultierte in dem logistischen Regressionsmodell MycPermCheck, welches in der Lage ist die Permeabilitätswahrscheinlichkeit kleiner organischer Moleküle anhand ihrer physikochemischen Eigenschaften vorherzusagen. Die Evaluation von MycPermCheck deutet auf eine große Vorhersagekraft hin. Das Modell wurde als frei zugänglicher online Service und als lokale Kommandozeilenversion implementiert. Methodiken und Ergebnisse aus beiden Teilen dieser Dissertation wurden kombiniert um ein virtuelles Screening nach antimycobakteriellen Substanzen durchzuführen. Myc\-PermCheck wurde verwendet um den chemischen Permeabilitätsraum von \textit{M.~tuberculosis} anhand der gesamten ZINC12 Datenbank wirkstoffähnlicher Moleküle abzuschätzen. Nach weiteren Filterschritten mit Bezug auf ADMET Eigenschaften, wurde InhA als exemplarisches Angriffsziel ausgewählt. Docking nach InhA führte schließlich zu einer Treffersubstanz, welche in darauffolgenden Schritten weiter optimiert wurde. Die Interaktionsqualität ausgewählter Derivate mit InhA wurde daraufhin mittels MD und SMD Simulationen in Bezug auf Protein und Ligand Stabilität, sowie auch der maximalen freien Energieänderung induzierter Ligandextraktion, untersucht. Die Ergebnisse der vorgestellten computerbasierten Experimente legen nahe, dass Substanzen mit einem Indol-3-Acethydrazid Gerüst eine neuartige Klasse von InhA Inhibitoren darstellen könnten. Weiterführende Untersuchungen könnten sich somit als lohnenswert erweisen

    Slow-Onset Inhibition of Mycobacterium tuberculosis InhA: Revealing Molecular Determinants of Residence Time by MD Simulations

    Get PDF
    An important kinetic parameter for drug efficacy is the residence time of a compound at a drug target, which is related to the dissociation rate constant koff. For the essential antimycobacterial target InhA, this parameter is most likely governed by the ordering of the flexible substrate binding loop (SBL). Whereas the diphenyl ether inhibitors 6PP and triclosan (TCL) do not show loop ordering and thus, no slow-binding inhibition and high koff values, the slightly modified PT70 leads to an ordered loop and a residence time of 24 minutes. To assess the structural differences of the complexes from a dynamic point of view, molecular dynamics (MD) simulations with a total sampling time of 3.0 µs were performed for three ligand-bound and two ligand-free (perturbed) InhA systems. The individual simulations show comparable conformational features with respect to both the binding pocket and the SBL, allowing to define five recurring conformational families. Based on their different occurrence frequencies in the simulated systems, the conformational preferences could be linked to structural differences of the respective ligands to reveal important determinants of residence time. The most abundant conformation besides the stable EI* state is characterized by a shift of Ile202 and Val203 toward the hydrophobic pocket of InhA. The analyses revealed potential directions for avoiding this conformational change and, thus, hindering rapid dissociation: (1) an anchor group in 2'-position of the B-ring for scaffold stabilization, (2) proper occupation of the hydrophobic pocket, and (3) the introduction of a barricade substituent in 5'-position of the diphenyl ether B-ring

    Mechanisms of drug-target complex formation.

    No full text
    <p><b>(a)</b> Equilibria for rapid reversible inhibition via a one-step mechanism and slow-binding inhibition via a two-step induced-fit mechanism along with a schematic free-energy profile for this reaction. E denotes the enzyme, I the inhibitor, EI the initial enzyme-inhibitor complex, and EI* the final enzyme-inhibitor complex. In the case of InhA, the diphenyl ether inhibitors bind to the enzyme with bound oxidized cofactor NAD<sup>+</sup>, forming a ternary complex. The double-headed arrows in the energy profile highlight the importance of the barrier height for the kinetics of the reaction. <b>(b)</b> Schematic free-energy profiles for a slow-binding inhibitor (left) and a destabilized EI* state as a consequence of ligand removal or the presence of a rapid-reversible inhibitor (right). Each macrostate (EI, EI*) is obviously associated with many microstates.</p

    Open and closed conformations of InhA observed in the MD simulations.

    No full text
    <p>Figure (a) shows the closed state represented by the medoid of conformational Family 1, figure (b) illustrates the open state represented by the medoid of cluster 4 (belonging to conformational Family 3). The same view of the binding pocket as in Fig 3 of Li et al. [<a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0127009#pone.0127009.ref017" target="_blank">17</a>] is used for better comparison. In this view, the portal-forming elements are located left (helix <i>α</i>6) and right (strand-4) of the binding site. The distances highlighted as yellow dashed lines were measured between Ala198/Ile202 on helix <i>α</i>6 and Phe97 on strand-4. For comparison, in the crystal structure of the <b>PT70</b> complex (PDB 2X23) respresenting the closed state, a distance of 4 Å is found between Ile202 and Phe97, whereas the open state is characterized by a distance of about 10 Å between Ala198 and Phe97 in chain B of the <b>PT155</b>-complex crystal structure (PDB 4OXN) [<a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0127009#pone.0127009.ref017" target="_blank">17</a>].</p

    Occurrence frequency (in % of the trajectory snapshots) of <i>α</i>-helix and 3<sub>10</sub>-helix motifs in the substrate binding loop.

    No full text
    <p>Each monomer of the simulated homotetrameric systems (150 ns) was analyzed, and data of the four monomers were combined to one box plot per system.</p

    Overview of the experimentally characterized diphenyl ethers analyzed in the MD simulations.

    No full text
    <p>The phenyl rings are referred to as A- and B-ring, respectively. The corresponding ether torsions are symbolized by curly arrows and labeled <i>α</i> and <i>β</i>. Experimental data were taken from [<a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0127009#pone.0127009.ref007" target="_blank">7</a>] and references provided therein. <b>PT70</b> is a slow-onset inhibitor, with measured dissociation rate constant <i>k</i><sub><i>off</i></sub> and residence time <i>t</i><sub><i>r</i></sub>. In contrast, <b>6PP</b> and <b>TCL</b> show rapid-reversible binding kinetics; <i>k</i><sub><i>off</i></sub> and <i>t</i><sub><i>r</i></sub> values were estimated assuming a value of 10<sup>9</sup> M<sup>−1</sup>s<sup>−1</sup> for <i>k</i><sub><i>on</i></sub>, as in [<a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0127009#pone.0127009.ref007" target="_blank">7</a>].</p

    2D density plot for the ether dihedral angles <i>α</i> and <i>β</i> of the unbound ligands PT70 (left) and 6PP (right) based on a 150 ns MD simulation in aqueous solution.

    No full text
    <p>The dihedral angles <i>α</i> (C<sub><i>OH</i></sub>-C-O-C) and <i>β</i> (C-O-C-C<sub><i>Me</i>/<i>H</i></sub>) are illustrated in <a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0127009#pone.0127009.g002" target="_blank">Fig 2</a>.</p
    corecore